Quantum query complexity of some graph problems* 

Christoph Diirr^ Mark Heiligman-t Peter H0yer§ Mehdi Mhalla^ 

January 15, 2004 



Abstract 

Quantum algorithms for graph problems are considered, both in the adjacency matrix model and 
in an adjacency list-like array model. We give almost tight lower and upper bounds for the bounded 
error quantum query complexity of Connectivity, Strong Connectivity, Minimum Spanning Tree, 
and Single Source Shortest Paths. For example we show that the query complexity of Minimum 
Spanning Tree is in Q{n'^^^) in the matrix model and in 0(-^/mn) in the array model, while the 
complexity of Connectivity is also in Q{n^^^) in the matrix model, but in 0(n) in the array model. 
The upper bounds utilize search procedures for finding minima of functions under various conditions. 

Keywords: graph theory, quantum algorithm, lower bound, connectivity, minimum spanning tree, single 
source shortest paths 

1 Introduction 

A primary goal of the theory of quantum complexity is to determine when quantum computers may offer a 
computational speed-up over classical computers. Today there are only a few results which give a polynomial 
time quantum algorithm for some problem for which no classical polynomial time solution is known. We 
are interested in studying the potentialities for speed-up for problems for which there already are efficient 
classical algorithms. Basic graphs problems are interesting candidates. 

We study the query complexity of these problems; meaning the minimal number of queries to the graph 
required for solving the problem. Throughout this paper, the symbol [n] denotes the set [0..n — 1]. We 
consider two query models for directed graphs: 

The adjacency matrix model, where the graph is given as the adjacency matrix M e {0, 1}"^"^ with 
Mij = 1 if and only if (v^, Vj) e E. 

The adjacency array model, where we are given the out-degrees of the vertices , . . . and for every 
vertex u an array with its neighbors fi : [df] [n]. So fi{j) returns the j*^ neighbor of vertex i, 
according to some arbitrary but fixed numbering of the outgoing edges of i. In this paper the upper 
bounds for this model are all at least n, so we assume henceforth that the degrees are given as part of 
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the input and we account only queries to the arrays /,;. In addition the arrays satisfy the simple graph 
promise 

Vze M,j,/e [fc],j^/:/,(j)^/,(/) 

ensuring the graph is not a multigraph, i.e. does not have multiple edges between any two vertices. 

For undirected graphs we require an additional promise on the input, namely that M is symmetric in the 
matrix model, and for the array model that Vi, i' £ [n] if 3j E [k] : fi{j) ~ i' then 3j' £ [k] : fi' [j') = i. Note 
that in the matrix model this symmetry assumption does not make undirected graph problems, promise 
problems since we may assume that the input is upper triangular. 

Weighted graphs are encoded by a weight matrix, where for convenience we set My = oo if {vi,Vj) ^ E. 
In the adjacency array model, the graph is encoded by a sequence of functions fi : [df] — > [n] x N, such that 
if fiU) = {'>'' t'w) then there is an edge {vi,Vi') and it has weight w. 

We emphasize that the array model is different from the standard list model. In the latter, we have 
access to the neighbors of a given vertex only as a list, and thus querying the i^^ neighbor requires i accesses 
to the list. This is also true on a quantum computer, so its speedup is quite restricted. 

Many other query models are of course possible, for example we could be given an array of edges / : 
[m] — > [n] X [71], or an ordered array (which is up to 0(ri) preprocessing the same as the adjacency array 
model). For simplicity, we use the array model as presented above. 

For the quantum query complexity of general monotone graph properties, a lower bound of fl{y/n) is 
known in the matrix model, as shown by Buhrman, Cleve, de Wolf and Zalka We are not aware of any 
quantum nor classical lower bounds in the array model. 

In this paper we show that the quantum query complexity of Connectivity is 9(n'^/^) in the matrix 
model and <d{n) in the array model. The classical randomized query complexity of Connectivity in the 
matrix model is f2(n^) by a sensitivity argument: Distinguishing the graph consisting of two length n/2 
paths from the graph consisting of those two paths, plus an additional edge connecting them, fi(ri^) queries 
are required. 

We study the complexity of three other problems. In Strong Connectivity we are given a directed 
graph and have to decide if there is a directed path between any pair of vertices. In Minimum Spanning 
Tree we are given a weighted graph and have to compute a spanning tree with minimal total edge weight. In 
Single Source Shortest Paths we have to compute the shortest paths according to the total edge weight 
from a given source vertex to every other vertex. The quantum query complexity of these three problems is 
r2(n'^/^) in the matrix model and il{y'nni) in the array model. We give almost tight upper bounds. 

problem matrix model array model 

minimum spanning tree Q{n^^^) Q{^nm) 

connectivity 6(71^/^) 6(n) 

strong connectivity 9(n'^/^) n{y'nm), 0{y/nm logn) 

single src. short, paths Q{n^/^), 0{n^^^ \og'^ n) fl{^/nTn), 0{^/nm\og'^ n) 



Table 1: Quantum query complexity of some graph problems 

We note that for graphs with a large number of edges (m = 0(n^)), the complexities are (almost) the 
same in the matrix and array model for all problems but Connectivity. However, the models still differ in 

that case. For example the test (u, v) £ E costs a single query in the matrix model and 0{\J min{c?u , dt}) 
queries in the array model since wc do not assume any order on the arrays /„ and f^. 

The time complexities of the algorithms are the same as the query complexities up to log-factors. The 
algorithms given for connectivity and strong connectivity can be altered to also output the (strongly) con- 
nected components without increasing the asymptotic complexity. The space requirement is O(logn) qubits 

^In a previous version of this paper, we said that Buhrman et al. conjectured f2(n) for Connectivity, and since their 
conjecture concerns arbitrary monotone graph properties in general, we gave a false impression of improving their result. We 
apologize. 
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and O(nlogn) classical bits. If we constraint the space (both classical and quantum) to O(logn) qubits, 
the problems may be solved by random walks. Quantum random walks has been the subject of several 
papers P^El: in particular for the si-CONNECTiviTY problem Q2i- 

Other work on the query complexity of graphs problems has been done independently of us. For example 

01 shows that testing whether a given graph is bi-partite needs fl{n^/^) queries in the matrix model. Note 
that a graph is bi-partite iff it does not contain an odd cycle. For the array model a lower bound can 
be constructed from our lower bound for connectivity, showing that fl(y^nm) queries are necessary for any 
bounded error quantum algorithm which distinguishes a single even cycle from two disjoint odd cycles. 
Matching upper bounds follow from our connectivity algorithms: simply construct a spanning forest of the 
graph. Then color alternating black-white the nodes of each tree. And finally use the quantum search 
procedure to find an edge with endpoints of the same color. Such an edge creates an odd cycle, and exists 
iff the graph contains an odd cycle. 

2 Tools used in this paper 

We use two fundamental tools. The first is amplitude amplification 0|S1, which we use when proving the 
upper bounds, the second is Ambainis' lower bound technique 

Amplitude amplification is a generalization of Lov Grover's search algorithm |13| . Since it is the most 
important tool used in our algorithms, we restate the exact results we require. We are given a boolean 
function F defined on a domain of size n. The function is given as a black box so that the only way we can 
obtain information about F is via evaluating F on elements in the domain. The search problem considered 
by Grover is to find an element x for which F{x) — 1, provided one exists. We say that a; is a solution to 
the search problem, and that x is good. We use three generalizations of the search algorithm — all of which 
we refer to as "the search algorithm" . 

• If there are t elements mapped to 1 under F, with t > 0, the search algorithm returns a solution after 
an expected number of at most j^^jriji queries to F. The output of the algorithm is chosen uniformly 
at random among the t solutions. The algorithm does not require prior knowledge of t ISj. 

• A second version uses 0{^n) queries to F in the worst case and outputs a solution with probability 
at least a constant, provided there is one [H]. 

• A third version uses 0{^n\ogl/ e) queries to F and finds a solution with probability at least 1 — e, 
provided there is one jH]. 

We note that for very sparse graphs given in the adjacency matrix model, it may for some applications 
be efficient to initially learn all entries of the matrix by reiterating the first version of the search algorithm, 
for instance as formalized in FactQ] 

Fact 1 Let k he given. There is a quantum algorithm that takes as input an n x n boolean matrix M , uses 
0{ny/k) queries to M , and outputs a set S of 1-entries of M so that with probability at least ^, S is of 
cardinality at least k or contains all 1-entries of M in case M has less than k 1-entries. 

Our lower bounds uses a technique introduced by Andris Ambainis. 

Theorem 2 ([2l theorem 6]) Let L C {0,1}* be a decision problem. Let X C L be a set of positive 
instances and Y (- L a set of negative instances. Let R C X x Y be a relation between instances of same 
size. Let values m, m', £xa and £'y j for x,y € {0, 1}" and x G X , y ^Y , i ^ [n] be so that 

• for every x ^ X there are at least m different y Cz Y in relation with x, 

• for every y (z Y there are at least m! different x €z X in relation with y, 
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• for every x G X and i G [n] there are at most £x,i different y G Y in relation with x which differ from 
X at entry i, 

• for every y € Y and i G [n] there are at most I'y ^ different x € X in relation with y which differ from 
y at entry i. 

Then the quantum query complexity of L is mm' / (.^aa^ where (.-max is the maximum of ixA^'yi subject 
to xRy and Xi ^ yi- 

3 Minima finding 

Many graph problems arc optimization problems, as are finding a minimum spanning tree, single source 
shortest paths, and largest connected components. Most quantum algorithms for such optimization problems 
utilize the search algorithm discussed above. A very basic and abstract optimization problem is as follows. 
Suppose we are given a function / defined on a domain of size n, and we want to find an index i so that f{i) 
is a minimum in the image of /. This minimization problem was considered in which gives an optimal 
quantum algorithm that uses 0{^Jn) queries to / and finds such an i with constant probability. It is very 
simple to analyze, and we present it now, since it will make the rest of the section easier to understand. 

1. Initially let j e [N] be an index chosen uniformly at random. 

2. Repeat forever 

(a) Find an index i £ [N] such that f{i) < f{j). 

(b) Set j := i. 

Theorem 3 f jllp The expected number of queries to f, until j contains the index of a minimum in the 
image of f is 0{\/N). 

Proof: Without loss of generality assume that / is injective. Now every index j S [N] has a rank, which we 
define as the number of indexes i such that f{i) < f{j). 

Let pr be the probability that at some moment of the algorithm, the index j will have rank r. We 
claim that Pr = \/r. Consider the first moment when j will have rank less or equal r. This moment will 
happen with probability 1. At that moment, because of the uniform choice in step 1 and since step 2(a) uses 
the quantum search procedure, j will be uniformly chosen among all indexes with rank less or equal r, so 
Pr = !/?'• 

If j has rank r, the search procedure of step 2(a) will require N/{r — 1) expected number of queries, 
for some constant c. Therefore the total number of queries until j contains the solution is 

JV N-l / N-l \ 

r=2 r=l \ "'''=1 / 

□ 

Let c'^/N be the expected number of queries to / until j contains the index of a minima. Stopping the 
algorithm after 2c' queries gives a quantum algorithm with error probability upper bounded by 1/2. 

For the purposes of this paper, we require the following generalizations of the minimum finding problem, 
illustrated in Figure ^ 

Problem 1 (Find d smallest values of a function) Let N* denote N U {oo}. Given function f : [N] 
N* and an integer d G [N], we wish to find d distinct indexes mapping to smallest values, i.e. a subset I C [N] 
of cardinality d such that for any j €E [N] \ I we have that f{i) < f{j) for all i € /. 
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In the rest of this section, we assume d < N/2. In the foUowing problem we are given a different function 
g : [N] N, such that g{j) defines the type of j. Let e — \{g{j) ■ j <S [N]}\ be the number of different types. 

Problem 2 (Find d elements of different type) Given function g and an integer d' we wish to find 
integer d ~ minjd', e} and a subset I C [N] of cardinality d such that g{i) ^ g(i') for all distinct i, i' G I ■ 

Now we present a generalization of both previous problems. 

Problem 3 (Find d smallest values of different type) Given two functions f, g and an integer d' we 
wish to find integer d = minjcJ', e} and a subset I C [A^] of cardinality d such that g(i) ^ g{i') for all distinct 
i,i' Cz I and such that for all j G [N] \ I and i Cz I , if f{j) < f{i) then f(i') < /(j) for some i' G / with 



4 points of min. value 4 points of diff. type 4 points of min. value 

and diff. type 



Figure 1: Illustration of the three problems. Each index i is illustrated by a point with horizontal coordinate 
g{i) and vertical coordinate /(«). 

It is clear that Problems ^ and [5] are special cases of Problem|2| In this section, we give an upper bound 
of 0{^/dN) for Problem 13 In Section |S1 we then show a lower bound of Q{\/dN) for Problems ^ and [3 
implying that all three problems are of complexity Q{V dN). We prove the upper bound by a simple greedy 
algorithm. Consider a subset / C [A^] of d indices of different types. We say an index j G [A^] is good for I if 

1. either g{j) = g{i) and /(j) < f{i) for some i G /, 

2. or g{i) ^ g{I) and /(j) < f{i) for some z G /. 

In the former case we say j is a good index of known type, in the latter that j is a good index of unknown 
type. In each iteration of the greedy algorithm, we find a good index j by the search algorithm and then 
improve / by replacing some index in / by j. 

1. Initially, let / = {N + 1, . . . , iV + d'} be a set of artificial indices of unique different types and unique 
maximal value. 

2. Repeat forever 

(a) Let t denote the number of good elements for /. (Note: This step is not required, but only 
included for the purpose of simplifying the analysis of the algorithm.) 

(b) Use the first version of the search algorithm to find a good element j G [N] for /. 

(c) Set / = improve(/, j) where we improve / by replacing with j the element in / that has the same 
type as j if j is of known type, and by replacing with j some element in / with largest /-value if 
j is of unknown type. 

The next lemma shows we only need an expected number of 0{d) iterations of the main loop to eliminate 
a constant fraction of the remaining good elements. 
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Lemma 4 Let I C [N] be any subset of dl indices of different types with t > good elements of e types. 
After an expected number of 0{d) iterations of the main loop there are at most |t good elements for I . Here 
d — min{d', e}. 

Proof: For notational simplicity assume / is injective. Set Iq — I and let Tq ~ T he the set of good 
elements for /. Let Tj denote the set of good elements after j iterations of the main loop, for j > 0. Similarly, 
let Ij denote the selected index-set after j iterations, for j > 0. Set tk = |Tfc|. In particular Iq = I and 
to = t. Let ^niid denote the [t/2j*^ smallest of the t elements according to /. For any subset 5 C [N + d'], 
let low(S') denote the number of elements in S that are no bigger than ymid according to /. 

Note that initially 

• low(ro) = [t/2\ and 

• low(/o) < d. 

By the nature of the greedy algorithm, low(rfc+i) < low(rfc) and low(/fc+i) > low(/fc) for any A; > 0. Note 
that 

• if low(Tfe) < J, then we have eliminated at least a fraction of -j of the initially t good elements for /, 
and similarly, 

• if low(//j ) = d, then we have eliminated at least a fraction of i of the initially t good elements for /. 

We claim that in each iteration of the main loop, as long as low(Tfc) > j, with probability at least at 
least one of the following two events happens 

. low(Tfc+i) <low(Tfe)(l- gi^), 

• 1ow(/a;+i) = low(/fc) + 1. 

Assume \ow{Tk) > |, since otherwise we are done. Consider the element j picked in Step I2bl First 
suppose the majority of the low(Tfc) indices are of unknown type with respect to Ik- Then, with probability 
at least |, index j is among these, in which case low(/fc+i) = low(/fc) + 1. 

Now suppose the majority of the low(rfc) indices are of known type with respect to Ik- Then, with 
probability at least i, index j is among these. Conditioned on this happens, with probability at least ^, 
there are at least '"'^^J''-* good elements for Ik of the same type as j. With probability at least i, at least 
half of these arc not good for Ik+i- Thus, with probability at least we have eliminated at least of 
the remaining elements in Tj- 

This proves the claim. It follows that after an expected number of 0{d) iterations of the main loop, we 
have eliminated at least a fraction of j of the initially t good elements. □ 

The above lemma implies that, for t > 2d, after an expected number of 0{dy/ N/t ) applications of 
function /, the number of good elements is at most |. Hence, for any t > 2d, the expected number 
applications of function / required till we have that t < 2d for the first time is in the order of 

e 0{VdN). 

Once t < 2d for the first time, the expected number of applications of / required before t ~ for the first 
time is in the order of V ^ 1 3 which is in 0(V dN ). 

Corollary 5 In the greedy algorithm given above, after an expected number of 0{-\/dN) applications of 
function f , there are no good elements for I , that is, t = 0. 

The next theorem follows immediately. 
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Theorem 6 The problem Find d Smallest Values of Different Type has bounded error quantum 
query complexity 0{^fdN). 

Nayak and Wu give in \^ a bounded error quantum algorithm that given a function / : [N] — * N and 
two integers d and A, outputs an index i such that the rank of f{i) is between d — A and c? + A. The query 
complexity of their algorithm is 0(M log M log log M) where M = ^JN/IS. + yJd{N - d)/A. Setting A = i 
it would find the d"^ smallest element with 0{V dN log N log log N) queries. Nayak ^H] later improved this 
algorithm to 0{\/ dN), matching the lower bound given in His method is different from ours. 



Remark 1 The algorithm above uses cV dN queries for some constant c and outputs the solution with 
probability at least 1/2. In order to reduce the error probability to \/2^ one could run the algorithm k times 
and among the dk resulting indices, output the d smallest values of different type. However starting each run 
with randomly chosen points of different type regardless of the previous outcome, is a waste of information. 
So it is much more clever to run the algorithm only once and stop it after kc\J dN queries. 

4 Minimum Spanning Tree 

In this section we consider undirected graphs with weighted edges. In Minimum Spanning Tree we wish 
to compute a cycle- free edge set of maximal cardinality that has minimum total weight. To be precise if the 
graph is not connected this is actually a spanning forest. 

Classically, there are a number of different approaches to finding minimum spanning trees efficiently, 
including the algorithms of Boruvka [SJ 150] , Kruskal QT] , and Prim [22 . To construct an efficient quantum 
algorithm, we use Boriivka's algorithm since it is of a highly parallel nature. This allows us to use the minima 
finding algorithms given in Section 13 

Boruvka's algorithm consists of at most logn iterations. In brief, initially it starts with a collection of n 
spanning trees, each tree containing a single vertex. In each iteration, it finds a minimum weight edge out of 
each tree in the collection, adds the edges to the trees, and merges them into larger and fewer trees. After 
at most logn iterations, there is only one tree left, which is a minimum spanning tree. The correctness of 
Boruvka's algorithm rests on the following simple fact about spanning trees. 

Fact 7 Let U d V be a set of vertices of a connected graph G = (V, E) and let e be a minimum weight edge 
of (U X U) D E. Then there is a minimum spanning tree containing e. 

In our quantum version of Boruvka's algorithm, we make a few adjustments to keep the overall error 
probability small without sacrificing in the number of queries. We adjust it slightly so that the t"^ iteration 
errs with probability at most 2fT2 j ensuring that the overall error is at most j . This increases the cost of the 

iteration by a factor of I, but since the cost of the first few iterations dominates, this is asymptotically 
negligible. The details follow. 

1. Let Ti,T2t . ■ ,Tk he a spanning forest. Initially, k = n and each tree Tj contains a single vertex. 

2. Set ^ = 0. 

3. Repeat until there is only a single spanning tree (i.e., k = 1). 

(a) Increment ^. 

(b) Find edges ei, 62, . . . , satisfying that Cj is a minimum weight edge leaving Tj. Interrupt when 
the total number of queries is (£ + 2)c\/km for some appropriate constant c. 

(c) Add the edges to the trees, merging them into larger trees. 

4. Return the spanning tree Ti. 
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To find the minimum edges ei, . . . ,efc in Step I3bl we use the following functions. In the array model, 
any edge {u, v) is coded twice, u appears as neighbor of v, but v also appears as neighbor of u. Enumerate 
the directed edges from to 2m — 1. Let / : [2m] N* denote the function that maps every directed edge 
{u, v) to its weight if u and v belong to different trees of the current spanning forest and to oo otherwise. 
Let g : [2m] — > [k] denote the function that maps every directed edge {u, v) to the index j of the tree 
Tj containing u. We then apply the algorithm for Finding k smallest values of different type, 
interrupting it after {£ + 2)cVkm queries to obtain an error probability at most 1/2^+^, see Remark^ 

Theorem 8 Given an undirected graph with weighted edges, the algorithm above outputs a spanning tree 
that is minimum with probability at least i. The algorithm uses 0{^nm) queries in the array model and 
0{n^/^) queries in the matrix model. 

Proof: To simplify the proof, consider the matrix model an instance of the array model with m = n{n — 1) 
edges. 

At the beginning of the i^^ iteration of the main loop, the number of trees k is at most n/2^~^, and thus 
it uses at most {£ + 2)c\J nm/2^~^ queries. Summing over all iterations, the total number of queries is at 
most X)f>i(^ + 2)c-\/ nm/2^~i, which is in 0{y/nm). 

The iteration introduces an error with probability at most 21^2 ■ The overall error probability is thus 
upper bounded by Y.tyi ^ \- ^ 

For our algorithms, a O(logn) factor applies when considering the bit computational model, rather 
than the algebraic computational model. Apart from that, the time complexity is the same as the query 
complexity by using an appropriate data structure. Each vertex holds a pointer to another vertex in the 
same component, except for a unique vertex per component that holds a null pointer. This vertex is called 
the canonical representative of the component. To decide if two vertices are in the same component, we need 
only to determine the canonical representative of each vertex by pointer chasing. To merge two components, 
we change the pointer of one of the canonical representative to point to the other. 

Using pointer chasing, the time complexity of the iteration is a factor of t larger than its query 
complexity. However, as in the case of the error reduction, this is insignificant: X]«>i ^(^ + 2)c-\/ nm/2^~^ is 
also in 0{^/nm). Thus, the time complexity is asymptotically the same as the query complexity. 

5 Connectivity 

A special case of Minimum Spanning Tree when aU edge weights are equal, is Graph Connectivity. 
The input is an undirected graph and the output is a spanning tree, provided the graph is connected. 

For the matrix model, the algorithm for minimum spanning tree given in the previous section implies 
an 0(n^/^) upper bound for graph connectivity as well. Below, we give a somewhat simpler and arguably 
more natural quantum algorithm of query complexity 0{n^^'^), which is optimal by the lower bound given 
in Section IHl below. 

For the array model, we give a quantum algorithm that uses only 0{n) queries. Both algorithms start 
with a collection of n connected components, one for each vertex, and greedily construct a spanning tree by 
repeatedly picking an edge that connects two of the components. 

Theorem 9 Given the adjacency matrix M of an undirected graph G, the algorithm below outputs a spanning 
tree for G after an expected number of 0{n^^^) queries to M, provided G is connected, and otherwise runs 
forever. 

Proof: Consider the following algorithm. 

1. hiitially the edge set A is empty. 

2. Repeat until A connects the graph. 
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(a) Search for a good edge, i.e., an edge that connects two different components in A, and add it 
to A. Use the version of the search algorithm that returns a sohition in expected 0(v? jt) queries 
if there are t > good edges and otherwise runs forever. 

3. Return the edge set A. 

Suppose the graph is connected and consider the expected total number of queries used by the algorithm. 
There are exactly n — 1 iterations of the main loop. The number of good edges is at least fc — 1 when A 
consists of k components, and thus the expected total number of queries is in the order of X]a:=2 V I ^ l); 
which is in Oir?!"^). □ 

When implementing the above algorithm, wc maintain an appropriate data structure containing informa- 
tion about the connected components in the graph induced by A. This introduces an additional O(nlogn) 
term in the running time of the algorithm which is negligible compared to 0(n'^/^). We may choose to stop 
the algorithm after twice the expected total number of queries, giving an 0[r?l^^ query algorithm with 
bounded one-sided error. 



5.1 The array model 

Lemma 10 Given an undirected graph G in the array model, we can in 0{n) classical queries construct a 
set of connected components {Ci, . . . , Ck} for some integer k, so that for each component G , its total degree 
mc — 'Yliiec'^i ^'^ more than |Cp. 

Proof: The algorithm is classical and is as follows. 

1. Initially the edge set A is empty. 

2. Let 5 = be the set of vertices not yet placed in some component. 

3. Let fc = be the number of components constructed thus far. 

4. While S is non-empty 

(a) Take the vertex v of highest degree in S and set D = {v}. 

(b) Go through w's list of neighbors one by one. each time adding the neighbor w to D and the edge 
{v,w) to A, until one of two events happens: (1) Wc reach the end of the list, or (2) we reach a 
neighbor w already assigned to some component Gj with j < k. 

(c) In case (1), set k = k + 1, Gk = D, and remove D from S. In case (2), add D to Cj, and remove 
D from S. 

5. Output fc, A, and Ci, C2, . . . , Cfc. 

The algorithm uses n — k queries in total, one query for each vertex but the first added to each component. 
Edge set A contains the union of spanning trees of the components Ci through Ck- 

To show correctness, let v be the vertex chosen in Step I4al and d its degree. Then d < \Cj\ for each 
components constructed so far, since the size of a freshly created component is the degree of one of its 
vertices, which by choice in Step I4al must be no less than d, and components can only grow. To show that 
the total degree of every component Gj is no more than |CjP, consider the two cases in Step I4bl 

In case (1), D is the set of v and its neighbors, each neighbor having degree no larger than d, implying 
the total degree is at most d{d + 1) which is strictly less than {d + 1)^ = I^P- In case (2), let a be the size 
of the component Gj to which D is merged, and b the size of D. Then b < d < a. The total degree is no 
more than + bd which is strictly less than {a + b)'^ . □ 

Theorem 11 Given an undirected graph G in the array model, the algorithm below outputs a spanning tree 
for G using an expected number of 0{n) queries, provided G is connected, and otherwise runs forever. 
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Proof: Consider the following algorithm. 



1. Construct the edge set A using the above lemma. 

2. Repeat until A connects the graph. 

(a) Pick a connected component C in A with smallest total degree, i.e, a component minimizing 

(b) Search for an edge out of C, i.e., an edge that connects C to some other component in A, and 
add it to A. Use the version of the search algorithm that returns a solution in expected 0{y/mc) 
queries if there is at least one such edge and otherwise runs forever. 

3. Return the edge set A. 

Suppose the graph is connected and consider the expected total number of queries used by the algorithm. 

We first construct k components, each component C having total degree mc at most |Cp. In each 
iteration of the main loop, we pick the component with smallest total degree and search for an edge out 
of C. The expected cost of finding such an edge is at most a^fmc for some constant a. We distribute this 
cost evenly among each of the mc edge endpoints in C, each endpoint paying a/ ^mc- 

Fix an arbitrary edge endpoint. Enumerate from up to at most logm the successive components that 
were chosen by the algorithm for a search and that contain this fixed edge endpoint. Let be the number of 
edge endpoint in the i'^ component. Then m^+i > 2mi. The total cost assigned to our fixed edge endpoint 
is upper bounded by 

log m log m 

Let C be any of the k components constructed in the first step. The total cost assigned over all edge 
endpoints in C is thus upper bounded by 4a^mc, which is at most 4a|C|. Summing over all k components, 
the total cost assigned in the main loop is at most 4an, which is linear in n. □ 



6 Strong Connectivity 

We give two quantum algorithms for strong connectivity, first one for the matrix model and then one for the 
array model. The input is a directed graph and the output is a set of at most 2(n — 1) edges that proves the 
graph is strongly connected, provided it is. It follows from the discussions below that such sets always exist. 

Theorem 12 Given the adjacency matrix M of a directed graph G and a vertex vq, the algorithm below 
uses 0{y/nm logn) queries to M and outputs a directed tree A C E rooted at vq. With probability at least 
j^, A spans all of G , provided such a spanning tree exists. 

Proof: Consider the following simple algorithm. 

1. Initially the edge set A is empty. 

2. Let S = {t'o} be a set of reachable vertices, and T = {vq} a stack of vertices to be processed. 

3. While T 7^ {} do 

(a) Let u be the top most vertex of stack T. 

(b) Search for a neighbor v of u not in S. Use the version of the search algorithm that uses 
0{\/ dti log 71 ) queries and outputs a solution with probability at least 1 — provided one 
exists. 

(c) If succeed, add (m, v) to A, add v to 5, and push v onto T. 
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(d) Otherwise, remove u from T. 
4. Return edge set A. 

For any vertex u let 6+ the out-degree in the tree A produced by the algorithm. Then the total num- 
ber of queries spent in finding the neighbors of u is in the order of X]t=i \l dt /t\/\ogn which is in 
0{\/btdt logn). Summing over all vertices u this gives 




ybtdtlogn < bt / dt y/log n = 0{^/nm log 71). 



uev 

The first inequality follows from the general statement that the inner product of two vectors is upper bounded 
by the product of their ^2-norms. The second inequality uses the fact that a tree has only 0{n) edges. The 
algorithm spends in addition 0(^^ \/ dt log n) queries for the unsuccessful neighbor searches, but this is 
dominated by the previous cost. 

The overall error probability is upper bounded by since each of the at most 2n — 1 searches has error 
at most Trk-. □ 

20n 

Theoremll2limplies that Strong Connectivity, too, can be solved using an expected number of queries 
in 0(n^/^ yTogn) in the adjacency matrix model. In fact the previous algorithm can be used for the matrix 
model with m = n{n — 1). We first check that some fixed vertex vq can reach any other vertex u, producing 
some spanning tree rooted at vo ■ We then check that all vertices can reach vo by repeating the previous step 
on the transposed adjacency matrix. The two stages produces a set at most 2(n — 1) edges that proves the 
graph is strongly connected. 

This is not possible in array model, since we store for any vertex u only the neighbors at which edges 
are pointing, and there is no easy access to vertices which are connected with a directed edge to u. The 
following theorem circumvents this obstacle. 

Theorem 13 Given a directed graph G in the array model and a vertex uq, the algorithm below uses 
0{\/nm logn) queries and outputs an edge set E' Q E covering vq- If G is strongly connected, then with 
probability at least j E' is strongly connected. 

Proof: In a first stage we use the previous algorithm to construct a directed depth first spanning tree 
AC- E rooted in vq. Assume vertexes to be named according to the order they are added to T. 

Then in a second stage we search for every vertex Vi <E V, the neighbor Vj with smallest index. The result 
is a set of backward edges B C E. We claim that the graph G{V, E) is strongly connected iff its subgraph 
G'iy, A{J B) is strongly connected. 

Clearly if G" is strongly connected then so is G since A U i? C £'. Therefore to show the converse assume 
G strongly connected. For a proof by contradiction let Vi be the vertex with smallest index, which is not 
connected to wq in G' . However by assumption there is a path in G from Vi to uq- Let {vi,vii) be its first 
edge with I > i and /' < i. We use the following property of depth first search. 

Lemma 14 Let vi and vi' be two vertexes in the graph G with I < I' . If there is a path from vi 
to vii in G{V,E) then vii is in the subtree of G" {V, A) with root vi. 

Therefore we can replace in the original path the portion from Vi to vi by a path only using edges from 
A. Let vi" be the neighbor of vi with smallest index. Clearly /" < I' < i. By the choice of Vi, there exists a 
path from uj" to vq in G". Together this gives a path from Vi to vq in G' contradicting the assumption and 
therefore concluding the correctness of the algorithm. 

Now we analyze the complexity. During the first stage, set A is computed in time 0(\/ n-^/^ logn). The 
second stage can be done with 0{^nm) queries using the minima finding for the mapping from an edge 
number in [1, to] to the source-target vertex pair. Both stages can be made succeed with probability at least 
7/8. □ 
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6.1 The matrix model 



Here all we need is to construct an oriented tree, rooted in some vertex vq, and it not need to be depth- first. 
We want this tree to cover all vertices reachable by vq. There is a tricky method for constructing such a tree 
with bounded error, without a log-factor in the running time as the previous algorithm. 

The idea is to classify vertices covered by the current tree, into sets Tq, . . . ,Tq such that the confidence 
that vertices from Ti have no new neighbors is increasing with i. Whenever a search of an edge with 
u G R and v ^ To U ... U Tq is successful, for some subset R Ti, the vertices R and u will be moved into 
To, otherwise, R will be moved into Ti+i. We make it formal now. 

1. Let S be the tree consisting of the single vertex vq. 

Let be the partitioning of the vertex set covered by S into Tq = {vq} and Ti = . . . = Tg = {}, for 
q= Llog2(n)J +1. 

2. While there is a set Ti with |Ti| > 2* do 

(a) Let i be the smallest index such that \Ti\ > 2*. 

(b) If \Ti\ < 2*+\ i? = otherwise R is an arbitrary subset of with \R\ ^ 2\ 

(c) Remove R from Ti. 

(d) Search for an edge (m, v) with u E R and v ^ S in search space of size 0(2'n) with the version of 
the quantum search procedure which uses 0(2'^*/*-\/n) queries and find a solution with probability 
1 — 1/2^^'*^ provided such an edge exists. 

(e) If the search was successful, add {u, v) to S and R U {vq} to Tq, otherwise add R to Ti+i. 

3. Output S. 

Now we show some properties of the algorithm. For convenience we define = jTol + iT'il + ■ • • + 

Lemma 15 At the beginning and the end of an iteration we have the following invariant. Let k be the 
.smallest index and i be the largest index of a non-empty set Tj . Then 

\Te\ > 2^-1 (1) 
yk < j < e : tj > 2^ (2) 

Proof: by induction on the iterations of the algorithm. Initially, when Tq is the unique non-empty set, the 
claim holds. 

Assume the claim holds before an iteration. 

First observe that by the induction assumption ^ ii k < £ then \Tk\ > 2*^, and so the index chosen by 
the algorithm is always i = k. Now if the search was successful, we have already to > S'^, and therefore 
tj > 2^ for all < j < fc and for all j > A;, increased by one, preserving condition (jS)). If the search was 
not successful, then by the choice of i?, after the iteration either = or t^ > 2'' . For all values j > k, tj 
is not modified, preserving condition 

Condition ^ is preserved because after decreasing every set Tj satisfies either iTjl = or |Tj| > 2^, and 
because whenever a set Tj becomes non-empty, it contains at least 2^~^ elements. □ 

As a consequence, when the algorithm stops, there is a unique non-empty set Ti and moreover 2*~^ < 
\Ti\ < 2*. Also since at most n—1 searches can be successful, the algorithm stops after O(nlogn) iterations. 

Lemma 16 When the algorithm stops, S covers all vertices reachable by vq with probability at least 2/3. 

Proof: Suppose the algorithm failed so there is an edge (m, v) with u € S and v ^ S which was not found 
for each call to the search procedure with a set Tj containing u. The probability of this event is the product 
of the failure probability over all calls. It is roughly upper bounded by the failure probability of the last 
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call, which is at most pi 2Y2^^'^^, where i ~ \S\. Let qi be the probability that the algorithm outputs a 
vertex set S with \S\ ~ i. Then the failure probability is upper bounded by 

n 

QiPi < max Pi = pi < 1/3. 

^ — ^ i—O 

i=0 

□ 

Now we analyze the complexity of the algorithm. 
Lemma 17 The expected number of queries done by the algorithm is 0(|S'|y^). 

Proof: To analyze the total number of queries done by the search procedures, we group the calls of the 
search procedures into sequences of unsuccessful searches ending with a success, plus the last sequence of 
unsuccessful searches. 

For the first case, let (u, v) be an arbitrary edge found by the algorithm. Then the probability that it 
was found when u G is upper bounded by the probability that it was not found when u G Ti_i, which is 
1/2^^. The cost of this search and the i — 1 unsuccessful searches over sets R containing u is order of 

^ 23^/4 VH=0(23'/4V^). 
Therefore the expected cost of finding (w, v) is at most 




To complete the analysis we upper bound the total work of all the 0(log7i) unsuccessful searches which 
were made after the last successful search. Let i be such that 2'^^ < \S\ < 2*. There where at most 2Y2^ 
searches for sets R with \R\ = 2^. Therefore the total work is order of 

i 

1=0 

This concludes the proof. □ 

7 Single source shortest paths 

Let G be a directed graph with non- negative edge weights and a fixed vertex vq. We want to compute for 
every vertex v a shortest path from vq to v. It may happen that it is not unique. Using for example the 
lexicographical ordering on vertex sequences, we choose to compute a single canonical shortest path. From 
now on assume that different paths have different lengths. As a result, the union over all vertices v of the 
shortest paths from vq to w is a shortest path tree. Let w) be the weight of edge {u,v) and S{vo,v) the 
shortest path length from vq to v. 

Classically Single Source Shortest Path may be solved by Dijkstra's algorithm. It maintains a 
subtree T with the "shortest path subtree" invariant: for any vertex w G T. the shortest path from vq to v 
uses only vertices from T. An edge (u, v) is called a border edge (of T) if u G T and w ^ T, and u is called 
the source vertex, v the target vertex. The cost of (m, u) is S{vo,u) + i>{u,v). Dijkstra's algorithm starts 
with T = {vq} and iteratively adds the cheapest border edge to it. 

Our improvement lays in the selection of the cheapest border edge. We give the algorithm for the array 
model. Setting m = implies the required bound for the matrix model. 
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Theorem 18 The hounded error query complexity of single source shortest path in the array model is 
0{\/nm log"^^^ n). 

Proof: As in Dijkstra's algorithm wc construct iteratively a tree T, such that for every vertex v G T, the 
shortest path from vq to v is in T. Wc also maintain a partition of the vertices covered by T, into a set 
sequence. Its length is denoted by /. 

1. T = {vo},l^l,Pi={vo} 

2. Repeat until T covers the graph 

(a) For Pi compute up to cheapest border edges with disjoint target vertices. For this purpose 
set TV = J2v£Pi number all edges with source in Pi from 1 to A^. Define the functions 
/ : [N] N* and g : [N] V, where g{i) is target vertex of the i*"^ edge and /(«) is its weight if 
g{i) ^ T and oo otherwise. Apply the algorithm of sectionOon / and g with d = \Pi\ to find the 
d lowest cost edges with distinct target vertices. Let Ai be the resulting edge set. 

(b) Let {u,v) be the minimal weighted edge of AiU. . .U A; with v ^ PiU . . .Pi. Set T = T(j{{u,v)}, 
Pi+i = {v} and 1 = 1 + 1. 

(c) As long as / > 2 and li^f-il = merge Pi into and set 1 = 1 — 1. 

All steps but 2(b) constructed a vertex set sequence Pi, . . . ,Pi, the cardinality of each being a power of 
2, and of strictly decreasing sizes. Figure [2| shows an example of this partitioning of the vertices in T. 




Figure 2: Left: example of the set decomposition for |T| = 7 into powers of 2. Right: example of corre- 
sponding edge sets. The closest border edge of T belongs to one of ^i, ^2, ^3- 

Therefore each set Pi is strictly larger than the union of all the following sets, since X]i=o^ 2' = 2'^ — 1. 
If Ai contained \Pi\ edges, than at least one of them has its target vertex outside of Pi . . . , P;. Let (u, v) be 
the cheapest border edge of T. Let Pi be the vertex set containing u. Then Ai must contain this edge, and 
step 2(b) selects it. 

Only step 2(a) generates queries to the graph. What is the total number of queries related to sets Pi of 
some size s? There are at most n/s sets of this size s. Therefore total work is order of ^sm^ , where 

rrij is the number of edges with source in the j*^ vertex set. We have ™i ~ ™- This worst case is 

when rrii = sm/n. In that case the total work is 0{^/nm) for the fixed size s. There are logn different set 
sizes in the algorithm. Each of the O(nlogn) queries to the minimum finding procedures should succeed 
with probability 1 — l/2nlogn at least This introduces a O(logn) factor, see Remark^ and we obtain the 
claimed complexity. □ 

8 The lower bounds 

Theorem 19 The problems Find d smallest values of a function, Find d elements of different 
TYPE and Find d smallest values of different type require Q,{y/dN) queries. 
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Proof: For even k and odd d we consider d x k boolean matrices with a single in every row. It is encoded 
by a function / : [N] {0, 1} with N = kd, such that for every i G [d],j£ [k], f{id + j) is the entry in row 
i and column j. Let function g : [N] [d + 1] be such that g{id + j) maps to i if f{id + j) =0 and to d 
otherwise. 

So the problems of Finding d+1 smallest values of a function or d + 1 elements of different 
TYPE or d + 1 SMALLEST VALUES OF DIFFERENT TYPE are all equivalent to finding the positions of the d 
zeroes in the matrix. 

Let X be the set of matrices such that exactly ld/2\ rows have their in the first k/2 columns. And let 
Y be the set of matrices such that this number is exactly \d/2] . We show a lower bound for distinguishing 
X and Y. We say that matrix ^ e X is in relation with B E Y iS both matrices differ at exactly two entries. 
It follows that there are indices i S [d], < j < k/2 < j' < k with Ay = Biji = 1 and Aij/ = Bij = 0. The 
following example illustrates this definition. 





/ k/2 k/2 \ 




/ k/2 k/2 \ 


A = 




B = 


oiTnT 




HI oil 




110 111 




^ 111 101 J 




^ 111 101 / 



Then the number of matrices which are in relation with a fixed matrix is at least m = m' ~ \d/2]k/2. 
For fixed i,j the number of matrices in relation with M and differing at i,j is k/2 if A/y = and 1 if 
Mij = 1. So Zniax = k/2 and mm' /I mux) gives the required lower bound. □ 

Now, we give a simple lower bound for Connectivity (and Strong Connectivity) in the fist model, 
by a reduction from PARITY. As we recently found out, this reduction has first been used by Henzinger and 
Fredman for the on-line connectivity problem |14j . We show later how to improve this construction. 

Lemma 20 Strong Connectivity requires ^l{n) queries in the array model. 

Proof: We use a straightforward reduction from Parity. 

a;= 1 1 1 




V2 



Figure 3: A standard reduction from parity 



Let X G {0, 1}^ be an instance of the parity problem. We construct a permutation f onV ~ {wq, . . . , i'2p-i} 
which has exactly 1 or 2 cycles depending the parity of x. For any i G [p] and the bit b ^ Xi, we define 
f[v2i) = V2i+2+b and /(u2i+i) = W2i+3-b, whcrc addition is modulo 2p. See figure|3| The graph defined by 
/ has 2 levels and p columns, each corresponding to a bit of x. A directed walk starting at vertex vq will 
go from left to right, changing level whenever the corresponding bit in a; is 1. So when x is even the walk 
returns to vq while having explored only half of the graph, otherwise it returns to vi connecting from there 
again to vq by p more steps. Since the query complexity of Parity is 57(n) — see for example |12) — this 
concludes the proof. □ 

The same technique can be extended to the undirected case. 
Corollary 21 Connectivity requires fl{n) queries in the array model. 

We improve the lower bound in Lemma QUI bv changing the construction slightly. 
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Theorem 22 Strong Connectivity requires n{y/nm) queries in the array model. 

Proof: Let m be such that m ~ kn for some integer k. We construct the lower bound for graph with regular 
out-degree k. 

We use a similar construction as for Lemma 1201 but now for every vertex the fc — 1 additional edges 
are redirected back to an origin. We would like to connect them back to a fixed vertex uq, but this would 
generate multiple edges and we want the proof work for simple graphs. Therefore we connect them back to 
a fc-clique which then is connected to uq. See figure^] Let vertex set V = {wq, . . . , V2p-i, uq, . . . , Uk-i} for 
some integer p. In the list model, the edges are defined by a function f : V x [k] ^ V. We consider only 
functions with the following restrictions: 

For every i G [k] we have f{ui,0) = vq and for j € {1, . . . , fc — 1} f{ui,j) = Ui^j, where addition is 
modulo k. 

For every i e [p] there exist jo,ji G [k] and a bit b such that /(w2i, jo) = ^2^+2+!, and f{v2i+i, ji) = 
V2i+3-b-, where addition is modulo 2p this time. We call these edges the forward edges. The backward edges 



are for all j S [k] f{v2i,j) = Uj whenever j ^ jo and f{v2i+i,j) 



whenever j 7^ ji . 




Figure 4: A strongly connected graph 

Now all the vertices are connected to the fc-clique, the clique is connected to vq, and the graph is strongly 
connected if and only if there is a path from vq to vi. 

Let X be the set of functions which define a strongly connected graph, and Y the set of functions which 
do not. Function / S X is in relation with 9 G y if there are numbers i G [p\,ja,ji, ho, hi G [k] with jo 7^ /lo, 
ji 7^ hi such that the only places where / and g differ are 



giv2i, ho) = f{v2t+l,.jl) 
f{v2i,ho) = Uhg 



g{v2t+i,hi) = /(w2i,.7o) 
g{v2i+i,ji) = 

f{v2i+l, hi) ^ Uhi 



Informally / and g are in relation if there is a level, where the forward edges are exchanged between a parallel 
and crossing configuration and in addition the edge labels are changed. 

Then m = m' = 0{nk^), p G 0{n) for the number of levels and (fc — 1)^ for the number of possible 
forward edge labels. We also have lf.{v,j) = k—lif f{v,j) G {uq, . . . ,Uk-i} and lf\(v,j) = {k— 1)^ otherwise. 
The value is the same. Since only one of /(w, j), g{v, j) can be in {uo, . . . , Ufc-i} we have /max = 0{k^) 

and the lower bound follows. □ 

For the matrix model, there is a much simpler lower bound which works even for undirected graphs. 
Theorem 23 Connectivity requires f2(n^/^) queries in the matrix model. 
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Proof: We use Ambainis' method for the following special problem. You are given a symmetric matrix 
M G {0, 1}"-^" with the promise that it is the adjacency matrix of a graph with exactly one or two cycles, 
and have to find out which is the case. 

Let X be the set of all adjacency matrices of a unique cycle, and Y the set of all adjacency matrices with 
exactly two cycles each of length between n/3 and 2n/3. We define the relation RCXxYa.sMR M' if 
there exist a, b,c,d G [n] such that the only difference between M and M' is that (a, b), (c, d) are edges in M 
but not in M' and (a, c), (6, d) are edges on M' but not in M. See figure^ The definition of Y implies that 
in M the distance from a to c is between n/3 and 2n/3. 




Figure 5: Illustration of the relation 

Then m = 0{n^) since there are n — 1 choices for the first edge and n/3 choices for the second edge. Also 
m' = O(n^) since from each cycle one edge must be picked, and cycle length is at least n/3. 

We have ^m,(z,j") = 4 if Mij = since in M' we have the additional edge and the endpoints of the 
second edge must be neighbors of i and j respectively. Moreover lM,(i,j) = 0(n) if Mi^ = 1 since then 
is one of the edges to be removed and there remains n/3 choices for the second edge. 

The values 1'^^, ^.^ are similar, so in the product one factor will always be constant while the other is 
linear giving lM,{i.j)l'M' {% j) ~ ^l"-) ^^"^ the theorem follows. □ 

We give a lower bound for both minimum spanning tree and single-source shortest paths. 

Theorem 24 Finding minimum spanning tree and single source shortest paths require fl{y^nm) queries. 

Proof: The proof is a reduction from minima finding. Let m ~ fc(n+l) for some integer k. Let M be a matrix 
with n rows and k columns and positive entries. The lower bound on minima finding, with d ~ n, N ^ kn, 
shows that ri(V kn'^) queries are required to find the minimum value in every row. 



S 




Ui U2 Us Ua U5 

Figure 6: Reduction from finding minima to minimal spanning tree 

We construct a weighted graph G from M, like this: The vertices are V{G) ~ {s,vi, . . . ,Vk,ui, . . . , u„}. 
The edges are all {s,Vi) with weight and all {vi,Uj) with weight Mji. (See figure 0) Then clearly a 
minimum spanning tree contains the 0- weight edges connecting s to all vertices Vi. And every vertex Uj will 
be connected to the rest of the graph only with the minimal weighted edge. □ 
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